Nonparametric Bayesian Co-clustering Ensembles
نویسندگان
چکیده
A nonparametric Bayesian approach to co-clustering ensembles is presented. Similar to clustering ensembles, coclustering ensembles combine various base co-clustering results to obtain a more robust consensus co-clustering. To avoid pre-specifying the number of co-clusters, we specify independent Dirichlet process priors for the row and column clusters. Thus, the numbers of rowand column-clusters are unbounded a priori; the actual numbers of clusters can be learned a posteriori from observations. Next, to model non-independence of rowand column-clusters, we employ a Mondrian Process as a prior distribution over partitions of the data matrix. As a result, the co-clusters are not restricted to a regular grid partition, but form nested partitions with varying resolutions. The empirical evaluation demonstrates the effectiveness of nonparametric Bayesian co-clustering ensembles and their advantages over traditional co-clustering methods.
منابع مشابه
Nonparametric Bayesian Models for Unsupervised Learning
NONPARAMETRIC BAYESIAN MODELS FOR UNSUPERVISED LEARNING Pu Wang, PhD George Mason University, 2011 Dissertation Director: Carlotta Domeniconi Unsupervised learning is an important topic in machine learning. In particular, clustering is an unsupervised learning problem that arises in a variety of applications for data analysis and mining. Unfortunately, clustering is an ill-posed problem and, as...
متن کاملNonparametric Bayesian Clustering Ensembles
Forming consensus clusters from multiple input clusterings can improve accuracy and robustness. Current clustering ensemble methods require specifying the number of consensus clusters. A poor choice can lead to under or over fitting. This paper proposes a nonparametric Bayesian clustering ensemble (NBCE) method, which can discover the number of clusters in the consensus clustering. Three infere...
متن کاملComputationally efficient methods of clustering ensemble construction for satellite image segmentation
Combining multiple partitions into single ensemble clustering solution is a prominent way to improve accuracy and stability of clustering solutions. One of the major problems in constructing clustering ensembles is high computational complexity of the common methods. In this paper two computationally efficient methods of constructing ensembles of nonparametric clustering algorithms are introduc...
متن کاملBayesian Framework for image segmentation Based on Nonparametric Clustering with Spatial Neighborhood Information
In this paper, we present a Bayesian framework for image segmentation based upon spatial nonparametric clustering. To estimate the density function on a nonparametric form, the 1 / 4
متن کاملGender-based Differences in Associations between Attitude and Self-esteem with Smoking Behavior among Adolescents: A Secondary Analysis Applying Bayesian Nonparametric Functional Latent Variable Model
Background: Different patterns of gender-based relationships between attitude toward smoking and self-esteem with smoking behavior have reported. However, such associations may be much more complex than a simply supposed linear relationship. We aimed to propose a method of providing hand details on the total and gender-based scenarios of the relationships between attitude toward smoking and sel...
متن کامل